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(57) Abstract: The present in- 
vention uses spectroscopic tissue 
volume measurements using 
non-ionizing radiation to detect 
pre-disease transformations in 
the tissue, which increase the risk 
for this disease in mammals. The 
method illuminating a volume 
of selected tissue of a mammal 
with light having wavelengths 
covering a pre-selected spectral 
range, detecting light transmitted 
through, or reflected from, 
volume of selected tissue, and 
obtaining a spectrum of the 
detected light. The spectrum of 
detected light is then represented 
by one or more basis spectral 
components, an error term, and an associated scalar coefficient for each of the basis spectral components. The associated scalar 
coefficient is calculated by minimizing the error term. The associated scalar coefficient of the each of the basis spectral components 
is correlated with a pre-selected property of the selected tissue known to be indicative of susceptibility of the tissue for the 
pre-selected disease to obtain the susceptibility for the mammal to developing the pre-selected disease. 




wo 03/043492 



PCT/CA02/01771 



OPTICAL TRANSILLUMINATION AND REFLECTANCE SPECTROSCOPY 

TO QUANTIFY DISEASE RISK 

CROSS REFERENCE TO RELATED UNITED STATES PATENT 
5 APPLICATION 

This patent application reiates to United States provisional patent 
application, Serial No. 60/331,633, filed on November 20, 2001, entitled 
OPTICAL TRANSILLUMINATION SPECTROSCOPY to quantify Disease 
10 Risk. 



FIELD OF THE INVENTION 

The present Invention reiates to the use of optical 
transillumination/reflectance spectroscopy as a diagnostic for detemiination of 
15 the risk for diseases. 

BACKGROUND OF THE INVENTION 

As eariy cancer or disease detection and diagnosis make further 
inroads into clinical practice, the high cost of cunrent disease screening 

20 techniques redirected the focus of the Investigators towards methods capable 
of quantifying the risk towards these diseases as a practical pre-screening 
tool. One particular application area Is epidemiological science and public 
health research into cancer prevention, so the invention is applicable for a 
wide variety of chronic or slowly developing diseases such as Alzheimer's, or 

25 Multiple Sclerosis. However, prevention required identification of the 
population at risk in association with an appropriate prevention or risk 
reduction intervention for example through the control of disease inducing 
agents or lifestyle changes (exercise, diet etc.) An example risk quantification 
is the work related to assessing the breast cancer risk In the general 

30 population or subgroups thereof by Boyd, Yaffe et al. (1-3) which showed that 
X-ray density pattems are identified as having one of the higheist odds ratios 
towards the risk of breast cancer between the low risk and the high risk 
groups. For the specific case of breast cancer, radiologically dense breast 
tissue on mammography indicates the presence of stromal and epithelial 

35 tissue In the breast, the amount of which is strongly related to risk of breast 
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cancer, with increasing amount of radiologically dense tissue related to 
increased risk. The ability to assess the breast cancer risk enables new steps 
in cancer prevention, for example through lifestyle and dietary changes (4). 
One of the major disadvantages of the current standard for Breast 

5 Cancer Risk assessment is the use of ionizing radiation. This results in the 
late introduction of this diagnostic modality during the life of a woman, due to 
the inherent risk when using ionizing radiation in a diagnostic modality. Other 
good risk predictors are in general also only available once a woman has 
reached around 40 years of age, such as cancer incidence in first degree 

10 relatives (mother and sisters). However, the late onset in using these risk 

assessment modality will reduce the available time frame for any intervention 
aimed at reducing the disease risk. Hence, there is a clear need for a non- 
ionizing modality which can be employed in young patients, here in post 
puberty pre-menopausal women . 

15 Non-ionizing radiation was employed iri various optical mammography 

approaches, usually in attempts to Image the breast, and to detect breast 
lesions (5,6) commonly using frequency domain technologies at only a few 
specialized wavelengths, or as spectroscopic approaches for the determining 
the tissue optical properties of normal versus malignant breast tissue (7). 

20 These spectroscopic applications, including an article by Egan and Dolen (8) 
are rather intended for determination of the probability for the presence of 
cancer, but do not address the concept of risk assessment, e-g. as a pre- 
screenlng tool. 

United States Patent No. 6,1 21 ,775 Is directed to an MRI imaging 
25 method and apparatus and provides a physical interrogation methods related 
to detecting small changes in tissue. 

United States Patent No. 5,079,698 is directed to a transillumination 
method and apparatus for the diagnosis of breast tumors and other breast 
lesions by normalization of an electronic image of the breast. Unites States 
30 Patent No. 6,002,958 is directed to a method and apparatus for diagnostics of 
internal organs. This patent teaches the use of NIR radiation In the 0.6-1 .5 um 
wavelength range and adds ultrasound to the analysis tools. These two 
patents specifically create images of the organ 
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United States Patent No. 6,095,982 discloses a spectroscopic method 
and apparatus for optically detecting abnormal mammalian epithelial tissue' 
covers only Raman and fluorescence methods. United States Patent No. 
6,069,689 discloses an apparatus and methods relating to optical systems for 
5 diagnosis of skin diseases while very generally written and addressing, 

reflectance, fluorescence and Raman, using a plurality of light emitting diodes. 
While some changes in the tissue (skin) are mentioned the idea of risk 
assessment is not included in this or any other patent related to the use of 
non-ionizing radiation, 

10 

SUMMARY OF THE INVENTION 

The present invention uses spectroscopic tissue volume 
measurements using non-ionizing radiation to detect pre-disease 
transformatiorls in the tissue, which increase the individuals risk for this 
15 disease. 

In one aspect of the present invention there is provided a method for 
assessing susceptibility for developing a pre-selected disease in a mammal, 
comprising: 

a) illuminating a volume of selected tissue of a mammal with light . 
20 having wavelengths covering a pre-selected spectral range; 

b) detecting light transmitted through, or reflected from, said volume of 
selected tissue, and obtaining a spectrum of said detected light; 

c) representing the spectrum of detected light by a set of basis spectral 
components, an error term, and an associated scalar coefficient for each 

25 basis spectral component in said set, tlie set of basis spectral components 
including at least one basis, spectral component, the associated scalar 
coefficient for each basis spectral component being calculated by minimizing 
the error term; and 

d) correlating the associated scalar coefficient for each spectral 

30 component with a pre-selected property of the selected tissue known to be 
indicative of susceptibility of the tissue for the pre-selected disease to obtain 
the susceptibility for the mammal to developing the pre-selected disease. 
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In another aspect of the invention there is provided an apparatus for 
assessing susceptibility for developing a pre-selected disease In a mannmai, 
comprising: 

a) holder means for holding and immobilizing an anatomical part of a 
5 mammal containing tissue to be optically probed; 

b) light source means for illuminating a volume of selected tissue of a 
mammal with light having wavelengths covering a pre-selected spectral 
range; 

c) detection means for detecting light transmitted through, or reflected 
10 from, said volume of selected tissue; 

d) computer control means connected to said detection means for 
producing a spectrum, of said detected light from an output of said detection 
mean, the computer control means including processing means for 
representing the spectrum of detected light by a set of basis spectral 

15 components, an error term, and an associated scalar coefficient for each 

spectral component In said set, the set of basis. spectral components including 
at least one basis spectral component, the associated scalar coefficient for 
each basis spectral component being calculated by minimizing the error term, 
the processing means includes means for conrelating the associated scalar 

20 coefficient for each basis spectral component with a pre-selected property of 
the selected tissue l<nown to be indicative of susceptibility of the tissue for the 
pre-selected disease to obtain the susceptibility for the mammal to developing 
the pre-selected disease, the computer control means including display 
means for displaying the susceptibility. 

25 . 

BRIEF DESCRIPTION OF THE DRAWINGS 

The method and apparatus constructed in accordance with the present 
invention will now be described, by way of example only, reference being iiad 
to the accompanying drawings, in which: 
30 Figure 1 shows the block diagram of one possible embodiment of an 

apparatus produced in accordance with the present invention; 

Figure 2 show a reconstruction of an experimentally obtained optical 
transillumination spectrum by a linear combination of either only the first two 
or four principle components. 
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Figure 3 shows an example of a typical set of measurements 
comprised of eight spectra from a volunteer representing the four quadrant of 
the breast on either side of the bilateral organ; 

Figure 4 shows the correlation of the ti and ta scores from the spectra 
that were repeatedly measured resulting in a regression slope and correlation 
coefficient close to unity; 

Figure 5 showing the first 4 components obtained from the PCA 
following data pre-processing which included thickness and transfer function 
correction, note that component 2 show inverse absorption for the lipid and 
water peaks respectively; 

Figure 6 shows the resulting weights or loading factors ti vs. for the 
first 2 principle components for thickness and transfer function correction as 
pre-processing option, square and rhombus symbols represent high and low 
breast density subjects, respectively; 

Figure 7 shows the first four components obtained from the PCA 
following data pre-processing which includes only transfer function correction; 

Figure 8 shows the resulting weights or loading factors ti vs. t2 for the 
first two principle components using transfer function correction as only pre- 
processing option, square and rhombus symbols represent high and low 
breast density subjects, respectively; 

. Figure 9 shows the first 4 components obtained from the PCA following 
data pre-processing which includes thickness and transfer function. correction 
followed by autoscaling of the data; 

Figure 10 shows the resulting weights or loading factors ti vs. t2 for the 
first 2 principle components using thickness and transfer function correction 
followed by autoscaling of the data, square and rhombus symbols represent 
high and low breast density subjects, respectively; 

Figure 1 1 assigns physical meaning to the four quadrants in the ti vs. 
t2 plots, shown here for thickness and transfer function corrected and 
autoscaled data 

Figure 12 shows an identification of spectra form low and high density 
breast-tissue within a 3D plot of vs. t2 and ta using thickness and transfer 
function correction as pre-processing options; 
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Figure 13 shows the predicted % parenchymal tissue density based on 
PLS analysis of the optical transillumination spectra versus percent density 
according computer assisted analysis of the mammograms, solid symbols 
refer to the training set and open symbols to the validation set; 
5 Figure 14 shows spectrally resolved transillumination and reflectance 

measurements are obtained by the used of one of more appropriate VIS/NIR 
light sources and opto-electronic detector systems measuring the surface 
proximal volume layer and the total volume separately; and 

Figure 1 5 shows an example of a three wavelength frequency domain 
10 system to determine the differential path-length factor of photons traversing 
tissue. 

DETAILED DESCRIPTION OF THE INVENTION 

Referring to Figure 1 shows an apparatus 10 used in the present 

15 invention comprises a light source 12, waveband selection devices, delivery 
optical fibre 14, tissue support 16, collection optical fibre bundle 18, 
wavelength dispersing element, opto-electronic detector 20 and CPU 22. 
Spectra can be corrected for the wavelength dependent transfer function of 
the instrument and the physical interoptode distance. It will be understood that 

20 this is not required as it can also be convoluted into the numerical correlation 
analysis. Additionally, the optical fibre 14, the tissue support and the optical 
fibre bundle is optional. The wavelength selective device can be either prior to 
or post the interrogated tissue. 

For use in studies related to breast tissue, the light source 12 was a 12 

25 Watt halogen lamp (Welch Allyn) with stabilized power supply serving as a 
broadband light source. The ultraviolet to green and mid infrared radiation 
were eliminated using a cut-off filter (at 500 nm) and a heat filter, respectively. 
The remaining light was coupled into a 5mm diameter liquid light guide, which 
served as the source fiber in contact with the skin on the top of the breast. A 

30 total power of 0.25 Watt, covering the 550 - 1300 nm bandwidth, was 

delivered to the skin surface. The holder of the source fiber and the plate in 
which the detector fiber was embedded were made of black plastic to absorb 
reemitted photons . The source fiber was in firm contact with the tissue and 
slightly compressed it.by 5mm, The source and detector fibers (optodes) were 
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held coaxially, pointing towards each other, via a custom-made calliper 
attached to the resting platform. The calliper allowed the measurement of the 
inter-optode distance (cm) and hence, the physical thickness of the 
interrogated tissue. Transmitted light was collected via a custom-made 7 mm 
diameter fiber bundle (P&P Optical Kitchener, Ontario, Canada). Wavelength 
resolved detection in the visible-near infrared was achieved using a 
holographic transillumination grating (15.7 rules/mm blazed at 850 nm) 
(Kaiser, California, USA) and a 2D, cryogenically-cooled silicon CCD 
(Photometries, New Jersey, USA) at a spectral resolution of less than 3 nm 
between 625 nm and 1060 nm. This spectral resolution was achieved by 
positioning a 0.5 mm slit between the distal end of the collection fiber and the 
spectral grating. The peak quantum efficiency (>0.8) of the detector was at 
780 nm. falling to 0.2 at 1 100 nm. By imaging the entrance slit of the 
spectrograph onto the 2D CCD, 50 rows of pixels were effectively exposed to 
detected light, thus increasing the dynamic range of the electronic detection 
by up to 50 times. To minimized background noise, cryogenic cooling 
reduces the dark counts to -0.06 electrons per hour. Further noise reduction 
was achieved using exposure times of 2-3 seconds and averaging up to five 
scans. The system's dynamic range was >5 OD (optical densities) with a 
signal-to-noise ratio of >10'* at peak sensitivity and >10 across the remaining 
bandwidth of the system . 

In order to collect the transmitted light emerging from the tissue, a 
wavelength dispersing medium (in thie preferred embodiment a grating) for the 
various wavelength ranges use, if present, and an opto-electronic detector , 
here a Si based (0.65 pm to 1 .5 pm) photodiode array (PDAs) are used. The 
PDA's information is transferred directly or delayed to the computer containing 
a numerical analysis prograrh to quantify the disease risk using a previously 
trained algorithm. The correlation between spectral data and disease risk is 
based on the spectral attenuation, through photon absorption and scattering, 
ohiy. the particular disease risk assessment system comprises in its current 
version a continuous optical spectrum with wavelength in the 550. nm to 1000 
nm range. 

Different embodiments of the apparatus may include an InGaAs based 
(0.9 pm to 1 .7 pm), or equivalent opto-electronic detector to cover also the 
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longer wavelength range which is transmitted through tissue, or can comprise 
also only various wavelengths or wavelength bands within the VIS/NIR range, 
such as attainable with filters of direct emission devices with limited bandwidth 
such as laser, LED or similar. 

In another embodiment of the apparatus there is included at least a 

single 

(whereas three is preferred) wavelength, frequency domain transillumination 
system, spanning most of the steady state wavelength range for 
transillumination. In one non-limiting embodiment the three wavelengths may 
be 785, 808, 905 nm. The lasers in the frequency domain system need to be 
modulated between 10 to 400 MHz (215MHz in the present system) to obtain 
a good phase shift resolution for 5 cm physical pathlength between delivery 
and detection optode. The system should provide a phase shift resolution of 
better than 2°. The lasers used here provide ^200 mW average power at the 
distal end of the delivery fibers in contact with the patient. Intensity modulation 
is achieved by modulating the driving current of the lasers via a biased-T, a 
method to recapture the transmitted light and guide it to the PMT with fast 
time response, a preamplifier filtered to the appropriate modulation frequency, 
a lock-in detection to quantify the demodulation and the phase shift compared 
to a standard either from the laser driver current or a portion of the light prior 
to entering the tissue, and a data transfer system to a computer to translate 
the phase shift and demodulation data into a differential path length factor and 
scattering coefficient In frequency domain measurements various modi 
operandi can be employed one being heterodyne detection technique, which 
can use also very lower power lasers. Other methods to quantify the optical 
pathlength are described (10, 1 1) and include time resolved detection or time 
resolved single photon detection . 

The ability to quantify disease risk and hence the population members 
at risk can be increased through additional measurements such as skin 
reflectance measurements to be able to subtract the skin contribution to the 
optical transillumination spectrum, so clearly this information may not be 
required. Depending on the part of the body being diagnosed additional 
components such as anatomical supports may be included, for example if the 
breasts are being studied, a support stand for the breast, and a holder for the 

8 
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liquid liglit guide and optical fibres for light delivery, and a random fibre bundle 
for light collection. The latter may be a trifurcated fibre. bundle to deliver 
transmitted photons to the PMT (frequency domain measurements), Si-based 
and InGaAs-based PDAs respectively. Again these are to a large extent 
5 optional, for example one can envision a cup Iil<e device which holds the 

sources and detectors in direct contact with the tissue. A means to measure 
the angle and distance between the delivery and detection optodes equivalent 
to the physical tissue thickness or in the case of a reflectance measurement 
the interoptode distance as the latter will determine the tissue depth most 

10 likely interrogated by the photons. It is noted that the inventors have shown 
. some correlation and predictive abilities with thickness coFrectlon of the data. 
Software to extract spectral absorption and light scattering features to quantify 
the risk for a specific disease is loaded onto the CPU. Software was trained 
using data from clinical studies providing the optical spectra and an 

15 independent measure of the risk as discussed hereinafter. 

In order to correct for variations in the wavelength dependent signal 
transfer function of the system, all transillumination spectra were referenced 
to a dally collected transmission standard made of 1 cm thick, ultra-high 
density polyurethane (Gigahertz Optics, Munich, Germany). All measured 

20 spectra are thus given as wavelength dependent (rel OD). This referencing to 
a known standard yields a universal applicable dataset and hence the 
subsequent developed mathematical models correlating transillumination 
speqtra with risk will also become universally valid, that is independent of the 
actual instrument used to collect the data. 

25 The spectral volume measurements can be augmented by adding 

extended long wavelength (conriprising the NIR range transmitted through 
tissue, e.g. up to 1 .7 ^im) and frequency domain measurements. The former 
providing more Information about vibrational bands of the biomolecules In the 
interrogated volume. The latter enables by using a limited number of 

30 wavelengths (3 or more), to determine the differential pathlength factors of the 
tissue (6). Combining this with the transillumination spectra allows one to 
obtain absolute absorption spectra and subsequently derivation of the 
contributions of various chromophores responsible for the absorption spectra 
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measured providing additional information for the identification of the 
population at risk and also insight into the moiecuiar changes associated with 
or resulting from the tissue transformation. 

5 ANALYSIS OF REFLECTANCE OR TRANSILLUMINATION OPTICAL 

DATA FOR ASSESSMENT OF DISEASE RISK 

The following describes four nonllmiting examples of mathematical 
modelling techniques capable of establishing a correlation between the optical 

10 spectra and a particular outcome, in the present case risk of disease. This is 
not a complete or comprehensive list of ail the methods available which may 
be used and other methods such as hybrid linear analysis (12) are available 
and persons skilled in this type of analysis can identify other methods. These 
spectral analysis techniques have been used extensively in chemometrics 

15 field to solve for the concentration of a constituent of interest without knowing 
the spectra of all constituents present in a chemical sample (13) or where 
multiple chemicals of interest have overlapping spectra, (14). These analysis 
techniques constmct models that identify the variance within the spectral data 
set or, when trained with a known range of concentrations, can identify the 

20 variance that is relevant to the constituent(s) of interest. Through training the 
model derives component spectra that resemble the constituent spectra within 
the sample, and can later be used to predict the concentrations of 
constituents from new sample spectra. However, these component spectra . 
may not have direct physical meaning but can represent the spectral features 

25 of the pure constituent spectra. 

In the case of breast cancer which is exemplified hereinafter, it has 
been shown by Thompson and Tatman (9) that the tissue composition will 
change slowly as the breast is undergoing sequential changes towards 
dysplasia, carcinoma //? situ and then invasive cancer, often involving or 

30 resulting in concentration change of different optically (light absorption or 
fluorescence) active molecules and/or structures (light scattering). 

In the specific case of breast cancer the idea is based on the fact that 
the same structures and chemicals, that give rise to the x-ray attenuation, the 
current (gold) standard for breast cancer risk assessment, will also result in 

10 
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changes in the transillumination spectra. These changes will be evident in the 
attenuation and/or scattering of the visible/NIR light due to different 
contributions of the optically active molecules. Correlation between the x-ray 
mammography and the spectral transillumination information can be 
5 established and quantified by a variety of different numerical methods, among 
which are principal component analysis or linear discriminate analysis when 
the breast density based on mammographic analysis is provided as 
classification (nominal data) or using partial least squares or principle 
component regression when the mammographic analysis is provided as % 
10 dense tissue (interval data). Besides x-ray mammographic analysis, other 

methods which may be used to obtain the parenchymal breast tissue density 
may include ultrasound, computed tomography, cone beam computed 
tomography, electrical impedance spectroscopy and magnetic resonance 
imaging, 

15 

Principal Component Analysis (PCA) and Linear Discriminant Analysis 
(LDA) 

PCA is optimized for comparison of vectors with nominal data, in this 
20 case correlating spectra to the density classification, thereby making use of 
the complete spectral information while reducing the dataset size. For this, 
PCA determines the amount of variance within the test group of spectra and 
uses it to iteratively reduce the dataset to representative spectra, called 
components. The spectral data are represented by a smaller number of 
25 vectors in a lower dimensional space while still including the maximum 

amount of variance within the data set. (1 5). This is accomplished by solving 
for the covariance or correlation matrix of the data matrix X(mxn) compnsed 
of the dataset of all spectra obtained (n) and the spectral range monitored 
(m), such that: 

30 .cov(X) = X"^X (1) 

n-1 

PCA decomposes the data matrix X as the sum of the outer products of the 
vectors t/ and p/ and a residual matrix E: 

35 
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X = ti p'^l + t2p'^2 + t3p"^3 + +t/p"^/+E 

or X = TP^+E (2) 

Where the elements of the t/ (n x 1) vectors are the scores that contain 
5 information on how the spectra relate to each other, and the p/ vectors (mxi) 
or components are the eigenvectors of the covariance matrix and show how 
the selected variance relates to each other. The components p / are 
eigenvectors of the covariance matrix, so that 

10 cov(X)p/=A,/ p/ (3) 

where X / is the eigenvalue associated with the eigenvector p /. Also for any X 
and t/ and p/ pair X p / = t /, (i.e. the score vector t / is the linear combination or 
representation of the original X defined by p /). It Is generally found that data 

15 can be described In fewer components p/ than original variables (m) and that 
the components can be combinations of variables that are useful descriptions 
of particular constituents In the tissue. As stated above, the shape of the 
useful components will be a combination of spectral signatures of 
chromophores that vary with tissue density. Scores (elements of t /) that 

20 differentiate between tissue densities identify useful components (p/). 

The scores (elements of t/) can be graphically plotted against one 
another to show any potential clustering of spectra that are related. In this 
study PCA was calculated oh the test set (2/3 of available spectra) and the 
same mathematical model, e.g. retaining the p/ was used to predict the scores 

25 t / on the validation set (1/3 spectra). 

Linear Discriminant Analysis (LDA) was used on the PCA scores to 
enhance the differentiation between the two extreme nominal categories (low 
vs high density), LDA finds a discriminant rule for defined, groups within the 
dataset. It is based on drawing the boundary halfway between the means on 

30 a pair wise basis (adjusted slightly If there is some prior probability of group 
Identity). It calculates the discriminant rule by computing optlnriizing factors 
{criterion j) that are based on the covariance matrix of the given groups. The 
targets that are used for training identify the density category group (1 6) 
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criterion j - inv (cov Xy) . S i, (4) 

Where y is the particular group and S ^ Is the between class variance defined 
by 

^»=Z(^/-^)' (5) 

J ■ 

In this study, the LDA algorithm was trained using the training dataset by 
leaving one spectrum out during each training run and then predicting it's 
tissue density group. The rest of the spectra from the validation set were then 
classified using the trained model. The LDA scores can also be plotted in a 
similar cluster diagram as the PCA scores (16)- Table 1 shows the key 
differencies between PCA and LDA. 

Statistical significance for the PCA and LDA prediction was established 
using measures similar to sensitivity and specificity values commonly used to 
evaluate the validity of diagnostic tests. As transillumination Is currently 
envisioned as a method for Identifying those women with high tissue density 
within the entire population, Increased HDM Is preferable over Increased 
LDM. Consequently, a high density measure (HDM) was used to assess the 
quality of the prediction model. HDM was defined as the ratio of those women 
that were predicted to have high density tissue with the optical 
transillumination method to those who were categorized as having high tissue 
density by the Radiologist. Conversely, the low density measure (LDM) is a 
measure of the ability to correctly identify those spectra that do not have high 
density. 

Table 1: Properties of Principal Component Analysis and Linear Discriminant 
Analysis. 
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Figure 2 shov\« a reconstruction of an eixperimentally obtained 
spectrum by eitlier oniy two or four principle components. 
Prfnclpal Component Regression and Partial Least Squares Analysis 

PGR and PLS are analysis techniques are optimized for comparison of 
vector inputs witli interval target data. For this case the spectra are compared 
to the percent density readings. Statistical significance was established using 
the Pearson con-elation coefficient as well as the slope and intercept of the 
regression ijne. Both PGR and PLS analysis require training and peri'orm 
regression analysis on the dataset, but use different types of variance 
Identification to predict the target variable, in this study the percent density 
read from the mammogram, see Table 2. 

As a first attempt for solving for a concentration of a coristituent. 
Inverse Least Squares analysis has been used for well behaved data, where, 
spectral features are not overiapped and predictions of the constituent is not 
effected by collinearity problenis v\^th other constituents in the sample. Here 
Y is defined as: 

Y = Xb (6) 
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Where b is the regression vector that is found by solving for the 
pseudolnverse of X. This cannot be used here since all the spectral features 
contributing to the spectral shape and that lead to density prediction are not 
defined so there could be collinearity problems. 

Principle Component Regression calculates principal components in a 
similar manner to Principal Component Analysis and then performs regression 
analysis of the scores matrix with the known targets or tissue densities from 
the training set. It does this by solving the following equation 

. Y = (P T^) b (7) 
where b = P (T*^ TT^ Y (8) 
which is similar for Inverse Least Squares analysis above in equation 7, 
except that X = PT^. The regression vector b is the least squares solution for 
T, where T are scores from the PCA and T is a(nxk) matrix, where n is the 
number of spectra and k is the number of factors used in the model. Y is the 
(n X 1) vector of targets, in this case percent density of the region of Interest of 
the mam mogram (17). 

For Partial Least Squares Analysis, the basis vectors or latent variables 
(equivalent to the principal components) are extracted using the related 
targets, such as percent density, by solving equation 7 and relating b to the 
targets directly by solving: 

b =W(P^W)-^(T^T)"^T'^Y (9) 
where W is a weighting vector that relates the target set (prediction values) 
with the variance In the dataset during the decomposition into factors. This 
allows the PLS algorithm to solve for components that are correlated with the 
targets (Y) while describing a large amount of the vsriation in X. (1 7) For a 
more detailed explanation please see Kowalski BR and GeladI P (1986),(17) 
Wise BM (2000),(15) and Haaland DM and Thomas EV (1988). (14) - 

With both PGR and PLS only.a single regression vector b is used in 
subsequent predictions of spectra. This b vector is selected by calculating the 
predicted residual error sum of squares (PRESS) value for each b/ using the 
training set. The chosen b vector has the fewest number of iterations (lowest 
value of i) and the lowest PRESS value when the training set is used to 
establish the modeil. 
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5 



Table 2: Properties of Principal Component Regression and Partial Least 
Squares Analysis. 
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10 

In this study the PRESS values were calculated for both PGR and PLS 
for the training sets, using the built-in cross-validation function in Matlab 6.0 
PLS toolbox. The b vector with the lowest PRESS value was sejected. The 
training set was randomized and split into four sections, the respective 
15 algorithms were then trained by a Venetian blind method with four repeats. 

The subsequent b vectors were then averaged and used to predict the tissue 
density for entire training set and subsequently the validation set. 

The method disclosed herein will now be described with a nonlimiting 
example of study in breast tissue, 

20 
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EXAMPLE 

BREAST TISSUE TRANSILLUMINATION STUDY 

Transillumination spectroscopy measurements were taken in a dark 
room with the volunteer seated comfortably, and the breast resting on a 
horizontal support (Figure 1). Total data acquisition time was approximately 
15 minutes allowing measurements at all 8 positions and necessary re- 
positioning of the optodes. There was little discomfort to the volunteers due to 
point compression of the breast by the source optode. 

Figure 3 shows an example of a typical set of measurements 
comprised of eight spectra from the volunteer. Spectra from the respective 
bilateral positions of the breast show symmetry, as expected In women with 
healthy breast tissue. In previous studies, this criterion of spectral symmetry 
in the same position of both breasts was used, by Egan and Dolan, to Indicate 
absence of breast cancer In the examined areas (8). The reproducibility of the 
optical transillumination measurements within one visit was analyzed by 
recruiting a volunteer to undergo multiple procedures. 

A total of 8 spectra were collected per volunteer representing the four 
quadrants (medial, distal, lateral and central) of both breasts. These spectra 
can be further pre-processed by correcting for tissue thickness (OD/cm) 
and/or auto-scaling prior to development and testing of the mathematical 
models used to then correlate with tissue density. In general for Principal 
Component Analysis and similar models, scaling Is used If the Input variables 
have different units or If there are large deviations in some input variables 
compared to other variables within the data set. 

For auto-scaling, the data was mean-centered, where the mean optical 
density value of the data set at each wavelength was subtracted from all 
spectra, and scaled to one unit variance by dividing all the values at each 
wavelength by the standard deviation for that wavelength. This was applied 
only to the training set data for PCA models developed, whereas scaling of 
the validation set the previous obtained riiean and standard deviation vectors 
were used. See Table 3 for pre-processing options used for the interpretation 
of the spectral data set. 
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Table 3. Pre-processing techniques of the spectral dataset and a reference 
number used in future sections. 



Pre-processing of Dataset 

Transfer function corrected I 
Thickness and transfer function 

II 

corrected 
Autoscaled and transfer 

III 

function corrected 



Autoscaled and thickness and 

IV 

transfer function corrected 

5 . 

A detailed example of establishing a correlation between optical 
spectra and disease risk follows for the example of PCA. It is generally found 
that data can be described in fewer components p/ than original variables 
(m=436 wavelength elements) and that the components can be combinations 

10 of variables that are useful descriptions of particular constituents in the tissue. 
As stated above, the shape of the useful components will be a combination of 
spectral signatures of chromophores that vary with tissue density. Scores 
(elements of t/) that differentiate between tissue densities Identify useful 
components (p /).The scores (elements of t /) can be graphically plotted 

15 against one another to show any potential clustering of spectra that are 

related- Here the PCA algorithm was trained on a test set (/7=544 spectra) 
and the same mathematical model, I.e. retaining the p/, was used to predict 
the scores t/ on the validation set (n=1 92 spectra). 

Statistical significance for the PCA prediction was established using 

20 high density measure (HDM) as it is preferable over increased low density 
measure (LDM) both are similar to sensitivity and specificity. As 
transillumination is envisioned as a method for identifying women with high 
tissue density within the entire population, improved HDM Is desired. 
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Consequently, HDM was used to assess the quality of the prediction 
model defined as the ratio of women predicted to have high density tissue to 
those who were categorized as having high tissue density by the radiologist. 
LDM is a measure of the ability to correctly identify those spectra that do not 
5 have high density. 

Figure 4 shows the correlation of the ti and t2 scores from the spectra 
that were repeatedly measured. Figure 4 also shows acceptable 
reproducibility with slopes of the regression and the values close to one. 
There is a spread in the component scores (t^ and t2), but there is a clustering 

10 of the position related data. 

Figure 5 shows the principal components (pi) from the PCA using n = 
544 spectra (thickness and transfer function corrected) obtained to date. 
These first four components contain 97.6 %, 1 .2 %, 0.6 % and 0.3 % of the 
variance in the total data set, respectively, for a combined total of 99.8% of 

15 the variance. The cluster plot of the scores for t| and t2 is shown in Figure 6. 

This plot illustrates discrimination of the breast tissue areas across a diagonal 
line in the ti vs. t2 space. 

Non-thickness corrected spectra were used to determine the effect of 
thickness on the robustness of the PCA model prediction. The component 

20 spectra for prP4 are very similar to the thickness-corrected component plot; 
see Figures 5 and 7 respectively. The ti vs. t2 plot, Figure 8, for this data 
shows discrimination as a function of t2 only. Similariy component spectra can 
be seen In Figure 9 for thickness and transfer function conrected data that 
were additionally autoscaled, with the ti vs. t2 cluster plot shown in Figure 10. 

25 The scores for both and t2 are centered on zero as expected but show 
similar clusters as Figure 8. 

HDM and LDM were determined to compare which spectral pre- 
processing option best differentiates the high and low density spectra in 
cluster plots according to Figure 6, 8 and 10. HDM and LDM data shown in 

30 the Table 4 were calculated from the same training and validation data sets 
separately. 
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Table 4: HDM and LDM of Principal Component Analysis results for test and 
validation set measurements. 



Pre-processing 
of Dataset 


Test set 
HDM LDM 


Validation Set 
HDM LDM 


Transfer function 












corrected 


1 


85% 


97.0 % 


88% 


90 % 


^Fiaurf^ 












Thickness and 












transfer function 












corrected 


II 


88% 


93 % 


93 % 


89 % 


(Figure 6) 












Autoscaled - 












transfer function 












corrected 


III 


86% 


94% 


. 90% 


86% 


(data not sliown) 












Autoscaled - 
tliicl<ness and 
transfer function 


IV 


87 % 


92% 


93% 


90% 



corrected 

(Figure 10) 



Autoscaling removes some spectral information since the mean 
spectrum is wavelength dependent- As the spectral features contributing to 
the discrimination between high and low breast density or risk are unknown, 
losing spectral information is very unwise. This is reflected in a reduced HDM 
and LDM as seen in Table 4 for pre-processing option III and IV. 

The scores tf and resulting from thickness corrected and non - 
con-ected component spectra demonstrated that it is possible to differentiate 
between the subjects having low or high breast tissue density subjects 
(Figures Sand 10). 

While mathematical models derived for thickness or non-thickness corrected 
optical spectra differentiate high and low breast tissue densities, their ti vs. 
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cluster plots differ. One explanation for the difference in the thickness 
corrected vs. non-corrected cluster plots is the effect of the physical tissue 
thickness on the overall variance within the spectral data set and therefore the 
determination of the components by the PCA algorithm. Thickness of the 
tissue contributes non-uhiformly to individual spectra as larger breasts tend to 
contain more fatty tissue. 

In the model of the data that was not thickness corrected, pi did not 
differentiate between high and low density tissue. This can also be seen in the 
magnitude of the in Figure 5 versus the non-thickness corrected component 
spectra in Figure 7. This indicates that the thickness values contribute to the 
magnitude of masking other contributions that do differentiate between 
tissue densities, such as light scattering. This leaves as the only 
component to preserve information to distinguish the density of the breast. 

When comparing the autoscaled versus non-autoscaled data, there 
were minimal changes in the principal component spectra and minor 
differences in HDM and LDM values, see Figures 6 and 10 and Table 4. 
Autoscaling as part of the pre-processing can degrade regions with flat or 
extreme spectral variation. (14) In this case, degraded spectral features could 
include regions of the spectrum with minimal wavelength dependence and 
hence a first derivative close to zero. For example, the haemoglobin inflection 
points are more pronounced in the non-autoscaled data than In the 
autoscaled components. Conversely, the large spectral features of water and 
lipids are large compared to other structures In the spectra, but are less 
pronounced after autoscaling. In this study, the only differences in the HDM 
values are in the training set with the non-autoscaled data having almost 2% 
higher scores for both HDM. and LDM. 

Principal components can reveal particular regions of the spectrum that 
represent important physical properties or entities within the tissue that 
contribute to differentiation. Component spectra 1 (p^) and 2 (ps) are the 
most important cover the highest amount pf variance in the data set. While 
components 3 and 4 have similar or inverse shape as component 2 they take 
less variance into account. 

Principal component 1 (pt) has a small dependence on wavelength and 
has negative values, in each prepared model. The spectrally flat 
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characteristics resulting from thickness corrected spectra can be attributed to 
attenuation due to absorption, scattering, and, therefore optical pathlength 
and losses at the tissue boundary. The surprisingly flat spectral shape of the 
scattering contribution is due to the derivation of OD used here, based on the 

5 wavelength dependent transfer function calibration by a polyurethane block 
which exhibits also high Mie scattering, e.g. Mie scattering is present in the 
spectrum and hence cancels itself out. We propose that carries optical 
pathlength information despite not showing the typical dependency, (18) 
but contains information to determine breast density through the overall 

10 scattering power. Low density tissue spectra have a reduced amount of 

scattering compared to high density tissue, and, therefore, higher values of 
in Figures 8 and 10. This relationship in scattering properties is also seen in 
the scattering coefficient data by Peters et aL (19) and Troy et al. (20) 
supporting this interpretation of the first principal component. 

15 The other component enabling differentiation between low and high 

tissue densities p2, has a more complex shape when compared to pi . The 
most important spectral features in the component are the lipid with inverse 
water peaks present at 930 nm and 980 nm, respectively. When t2 values are 
positive, the lipid peak is the dominant spectral feature as anticipated for fatty 

20 or low density tissue. Spectra from the high density tissue have negative t2 
values and water absorption becomes the dominant structure in the 
component spectrum. Graham et aL (21 ) also observed this relationship 
between water and density values when using MRI to quantify percent 
density. In their study the water content of the tissue was measured directly 

25 and showed good correlation to percent tissue density (r = 0;79). (21 ) 

Spectral feature contributions by haemoglobin can be seen between 
625 and 850 nm within P2 where the negative slope and inflection points of the 
haemoglobin curve are apparent. Dense breast tissue has large negative 
scores (t2) compared to the low density tissue. Haemoglobin contributes to 

30 the variance in P2, with elevated contribution of blood related absorption for 
high wgter absorption seen in the higher density tissue. This simultaneous 
appearance of water and haemoglobin absorption can be explained 
physiologically, as tissues with higher water content and hence cellular 
content, require improved vascular supply and, thereby, increased blood 
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volume. (22) Since positive t2 scores are related to low tissue density and 
positive scores are related to low tissue scatter, the cluster plot of t1 and t2 
can be divided into quadrants as shown in Figure 11, highlighting the 
relationship between the spectral features and the known physical attributes 
5 of the breast tissue. 

Even though p3 and p4 did not show differentiation between high and 
low density tissue by themselves in a 2-dimensional plot like p1 and p2, areas 
of the component spectra can be interpreted. Additionally p3 shows a lipid 
absorption peak, but water and hemoglobin absorption are almost absent 

10 additionally the lipid peak is shifted towards longer wavelength, so the reason 
therefore is unknown, p4 shows influence from the hemoglobin, with the 
same slope but inverse inflection points to p2. Even though differences in the 
amplitude and general shape of the curves are minimal when compared to p2 
the magnitude of the scores t3 and t4 are much smaller than that of the first 

15 two components- One cannot exclude specifically p4 as a third contributor to 
increase HDM and LDM when using a three dimensional analysis as 
suggested in figure 12. 

Figure 13 shows an example of representing the parenchymal tissue 
density on an inten/al scale (% dense area in the mammogram) rather than a 

20 nominal scale (low, medium, high density). The % dense area can be 

determined using a computer assisted imaging program by a trained observer 
such as for example a radiologist. In the case presented here no trained 
person was utilized and hence the percent density from the mammograms 
contained a large error in the repeat measure which ultimately limits the 

25 accuracy of the presented correlation. However, it serves to indicate that the 
variance reduction programs employed are capable of providing a single 
integer number to the physician monitoring his. patients' health. 

The results disclosed herein show that In vivo optical transillumination 
spectroscopy is a technically feasible method and capable of predicting breast 

30 tissue densities with acceptable correlation to mammographic densities as an 
Indirect method of cancer risk assessment. Thus, transillumination 
spectroscopy may offer a novel 'first step" in the risk assessment of healthy 
women regardless of menstrual cycle, age, ethnic background or menopausal 
status as the data here was not stratified by either event. 
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HDM and LDM values close to or above 90 % are encouraging to 
distinguish between low and high density tissues. These HDM and LDM 
values are higher compared with other physical examinations, such as 
ultrasound (23) and magnetic resonance imaging, (21) reported to be 
between 70-80 %. 

Optical transillumination spectroscopy offers the potential of a real-time 
and cost-effective method compared with ultrasound and has the ability to 
quantify a large range of tissue densities for breasts that are up to 6 cm in 
thicl<ness. An added advantage of transillumination spectroscopy over 
ultrasound and MRI is the fact that results are derived from preset 
mathematical models and, hence, no additional highly trained personnel are 
required for assessment. This reduces the overall cost to the healthcare 
system for this risic-assessment technique. A painless procedure and the 
inherent safety of this method will likely contribute to a high compliance rate. 

X-ray mammography uses Ionizing radiation and is considered 
unacceptable as a tool to assess breast density for women less than forty 
years of age and for frequent measurement. However, transillumination 
spectroscopy Is safe for women of all ages. This allows risk assessment to 
commence at a much younger age when the life style and diet are perhaps 
easier to influence have more time to exert their beneficial effects to reduce 
the risk and could ultimately lead to reduced incidence rates. 

While optical transillumination spectroscopy Is a promising tool to 
monitor the effectiveness of chemopreventlve, dietary or lifestyle studies for 
the reduction of breast cancer risk, its ability to detect physical changes over a 
period of time in the breast tissue of a given individual needs to be 
demonstrated in a proispectlve longitudinal study. 

The predictive value of optical spectroscopy for disease susceptibillly 
quantification can be Increased through additional measurements which can 
Include extending the optical waveband, obtain optical information of 
Interrogated tissue not contributing to the disease risk and obtaining 
Information separately for light scattering. 

The first option Includes the use of wavelength up to 1 .7 |Lim which 
contain among others additional water and lipid absorption bands using ah 
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InGaAs based opto-electronic detector or an equivalent system. This is 
particular of interest for large tissue volumes. If tlie optically interrogated 
tissue volume is small, that is in the range of several mm^, the short 
wavelength band using light of approximately 360 nm to 600 nm may provide 
5 the relevant information. 

For the second option, using a combined transmittance and reflectance 
measurement as shown in Figure 14 enables collection of the optical 
information related to the superficial tissue, here skin, separately through the 
reflectance technique, whereas the transillumination spectroscopy contain 

10 information about the superficial and deep tissue. 

As described above frequency or time domain reflectance and 
transillumination measurements can provide an effective optical path length 
through phase shift or lifetime measurements. The optical path length relative 
to the physical path length between the optodes is related to the light 

15 scattering power of the tissue. Figure 1 5 shows an example of an 

embodiment of a system comprised of three diode lasers to measure the 
breast tissue light scattering power. The remaining part of the spectrum can 
be interpolated as shown in the paper by Cerussl et al (24). 

The present method and apparatus disclosed herein has been 

20 exemplified using breast cancer as the disease of interest which involves 
correlating the associated scalar coefficient of the basis spectral 
component(s) with the pre-selected property of parenchymal breast tissue 
density known to be indicative of susceptibility of breast tissue for breast 
cancer. 

25 It will however be understood that this invention is not restricted to use 

in assessing risk for breast cancer but many other diseases as well. For 
example, the method disclosed herein Is contemplated to be applicable for 
correlating optical information of other mammalian tissue with risk factors 
associated with diseases such as neurodegenerative diseases including 

30 Multiple Sclerosis, Alzheimer's and Parkinson's diseases; oncology including, 
prostate, rectal and testicular cancers, autoimmune disieases including 
Sinustisus, rheumatoid arthritis, and Chron's disease. In each case the 
relevant tissues are optically sampled and the basis spectral component(s) 
are obtained using other techniques such as ultrasound, X-ray analysis, 
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magnetic resonance imaging, potential molecular markers indicating initial 
changes in the tissue, or epidemiologically derived questionnaires proven to 
correlate with the disease of interest as just a couple of examples. The 
present invention is applicable to mammals In general and is not restricted to 
humans. 

As used herein, the terms "comprises", "comprising", "includes" and 
"including" are to be construed as being inclusive and open ended, and not 
exclusive. Specifically, when used in this specification Including claims, the 
terms "comprises", "comprising", "includes" and "including" and variations 
thereof mean the specified features, steps or components are Included. These 
terms are not to be Interpreted to exclude the presence of other features, 
siteps or components. 

The foregoing description of the preferred embodiments of the 
invention has been presented to illustrate the principles of the invention and 
not to limit the Invention to the particular embodiment illustrated. It is intended 
that the scope of the invention be defined by all of the embodiments 
encompassed within the following claims and their equivalents. 
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1 . A method for assessing susceptibility for developing a pre-selected 
disease in a mammal, comprising: 

a) illuminating a volume of selected tissue of a mammal with light 
having wavelengths covering a pre-selected spectral range; 

b) detecting light transmitted through, or reflected from, said volume of 
selected tissue, and obtaining a spectrum of said detected light; 

c) representing the spectrum of detected light by a set of basis spectral 
components, an error term, and an associated scalar coefficient for each 
basis spectral component in said set, the set of basis spectral components 
including at least one basis spectral component, the associated scalar 
coefficient for each basis spectral component being calculated by minimizing 
the error term; and 

d) correlating the associated scalar coefficient for each basis spectral 
component with a pre-selected property of the selected tissue known to be 
indicative of susceptibility of the tissue for the pre-selected disease to obtain 
the susceptibility for the mammal to developing the pre-selected disease. 

2. The method according to claim. 1 wherein the set of bas;is spectral 
components is obtained by the steps of 

obtaining a rang3 of values of the pre-selected property from a 
population cross section which is representative of the mammalian population 
at large, but who are not expressing the pre-selected disease; 

in each mammalian member of said population cross section obtaining 
at least one spectrum by illuminating a volume of selected tissue with liglit 
having wavelengths covering the pre-selected spectral range, and detecting 
light transmitted through, or reflected from, said volume of selected tissue; 

extracting common spectral elements from the at least one spectrum of 
each member of said population cross section and producing at least one set 
of component spectra from the extracted common spectral elements, the at 
least one set of component spectra including at least one component spectra; 
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identifying a set of component spectra which gives the highest 
predictive value for the pre-selected property of the selected tissue known to 
be indicative of susceptibility for the pre-selected disease; and 

using said set of component spectra which gives the highest predictive 
value for the pre-selected property of the selected tissue l<nown to be 
indicative of susceptibility for the pre-selected disease as the set of basis 
spectral components in step c) of claim 1 , 

3. The method according to claim 1 or 2 wherein the step of extracting 
common spectral elements from the at least one spectrum of each member of 
said population cross section and producing at least one set of component 
spectra from the extracted common spectral elements inciudes using one of 
Principle Component Analysis, Partial Least Squares and Principle 
Component Regression. 

4. The method according to claim 1 , 2 or 3 wherein the step of calculating 
the associated scalar coefficient by minimizing the error tenm includes using 
Chi-square fitting to minimize the error term. 

5. The method according to claim 1 , 2, 3 or 4 including repeating steps a) 
to d) inclusively for a particular mammal periodically over a pre-selected 
period of the mammal's lifetime and storing the susceptibiiities calculated 
periodically to detect a rate of change in susceptibility for increased risic for 
the pre-selected disease. 

6. The method according to claim 1 , 2, 3, 4 or 5 wherein said mammal is 
a post puberty human female, and wherein the pre-selected disease is breast 
cancer, and the pre-selected property is parenchymal breast tissue density. 

7. The method according to claim 6 wherein the parenchymal breast 
tissue density is measured by one of x-ray mammography, ultrasound, 
computed tomography, cone beam computed tomography, electrical 
impedance spectroscopy and magnetic resonance imaging. 
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8. The method according to claim 1 wlierein the spectrum of said 
detected light is a continuous spectrum. 

9. The method according to claim 1,2,3, 4, 5, 6, 7 or 8 wherein the 
spectrum of said detected light is a discrete spectrum. 

1 0. ' The method according to claim 1 , 2, 3, 4, 5, 6, 7or 8 wherein the step 
of Illuminating a volume of selected tissue of a mammal with light having 
wavelengths covering a pre-selected spectral range includes Illuminating with 
broad band light, discrete wavelengths or wavelength bands. 

1 1 . The method according to claim 1 , 2, 3, 4, 5, 6, 7, 8, 9 or 10 wherein the 
steps of illuminating a volume of selected tissue and detecting light 
transmitted through, or reflected from, said volume of selected tissue includes 
adjusting an angle and a distance between a position on the tissue surface 
which Is illuminated and a position on the tissue surface where the transmitted 
or reflected light emanates from which is detected. 

12. The method according to claim 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or 1 1 wherein 
said pre-selected disease is selected from the group consisting of 
neurodegenerative diseases, oncology based diseases and autoimmune 
diseases. 

13. The method according to claim 12 wherein said neurodegenerative 
diseases include Multiple Sclerosis, Alzheimer's and Pari^inson's disease, 
and wherein said oncology based diseases include prostate, rectal and 
testicular cancers, and wherein said autoimmune diseases include sinustisus, 
rheumatoid arthritis, and Chron's disease. 

14. The method according to claim 2 wherein the set of basis. spectral 
components includes two or more component spectra. 

15. The method according to claim 1 wherein step d) includes using an 
effective mathematical variance reduction model to identify the basis spectral 
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components which through the associated scalar coefficient shows best 
correlation with the pre-selected property of the selected tissue known to be 
indicative of susceptibility of the tissue for the pre-selected disease. 

16. An apparatus for optical assessing susceptibility for developing a pre- 
selected disease in a mammal, comprising: 

a) holder means for holding and immobilizing an anatomical part of a 
mammal containing tissue to be optically probed; 

b) light source means for illuminating a volume of selected tissue of a 
mammal with light having wavelengths covering a pre-selected spectral 
range; 

c) detection means for detecting light transmitted through, or reflected 
from, said volume of selected tissue; 

d) computer control means connected to said detection means for 
producing a spectrum of said detected light from an output of said detection . 
mean, the computer control means including processing means for 
representing the spectrum of detected light by a set of basis spectral 
components, an error term, and an associated scalar coefficient for each 
spectral component in said set, the set of basis spectral components including 
at least one basis spectral component, the associated scalar coefficient for 
each basis spectral component being calculated by minimizing the error term, 
the processing means includes means for correlating the associated scalar 
coefficient for each basis spectral component with a pre-selected property of 
the selected tissue known to be indicative of susceptibility of the tissue for the 
pre-selected disease to obtain the susceptibility for the mammal to developing . 
the pre-selected disease, the computer control means including display 
means for displaying the susceptibility. 

17. The apparatus according to claim 16 wherein the at least one basis 
spectral component is obtained by the steps of 

obtaining a range of values of the pre-selected property from a 
population cross section wlilch is representative of the mammalian population 
at large, but who are not expressing the pre-selected disease; 
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m each mammalian member of said population cross section obtaining 
at least one spectrum by illuminating a volume of selected tissue with light 
having wavelengths covering the pre-selected spectral range, and detecting 
light transmitted through, or reflected from, said volume of selected tissue; 

extracting common spectral elements from the at least one spectrum of 
each member of said population cross section and producing at least one set 
of component spectra from the extracted common spectral elements, the at 
least one set of component spectra including at least one component spectra; 

identifying a set of component spectra which gives the highest 
predictive value for the pre-selected property of the selected tissue known to 
be indicative of susceptibility for the pre-selected disease; and 

using said set of component spectra which gives the highest predictive 
value for the pre-selected property of the selected tissue Icnown to be 
Indicative of susceptibility for the pre-selected disease as the set of basis 
spectral components In claim 16. 

18. The apparatus according to claim 16 wherein the light source means 
includes either one of a white light source such as a filtered halogen lamp, a 
series of bandwidth limited light emitting diodes (LEDs), several laser light 
sources all emitting within the pre-selected spectral range. 

19. The apparatus according to claim 16 wherein the detection means 
includes either wavelength selective means in conjunction with a charge 
coupled device or a photodiode array all for parallel detection of the pre- 
selected wavelength band or an avalanche photodiode or point photo detector 
for sequential detection of the pre-selected wavelength band. 

20. The apparatus according to claim 16, 17, 18 or 19 wherein the 
processing means includes calibration means for measuring a wavelength 
dependent instrument transfer function, and wherein said processing means 
includes means for removing instrumental affects on the spectrum of detected 
light. 
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21 . The apparatus according to claim 19 wherein the calibration means for 
measuring a wavelength dependent instrument transfer function includes a 
pre-selected volume of material which preferentially scatters light rather than 
absorbing light, positioned in place of the pre-selected tissue, 

22. The apparatus according to claim 21 wherein the material which 
preferentially scatters light rather than absorbing light is comprised of highly 
density poly-urethane. 

23. The apparatus according to claim 16 wherein said detection means is 
positioned to detect light reflected from the tissue being Illuminated, including 
means for adjusting the angle and distance between the light source means 
and this detection means. 

24. The apparatus according to claim 16, 17, 18, 19, 20, 21 , 22 or 23 
wherein the light source means includes a light source and an optical fiber 
with a proximal end positioned to couple light into said optical fiber and a 
distal end from which light emanates from the fiber positioned so as to direct 
light onto the pre-selected tissue. 

25. The apparatus according to claim 16, 17, 18, 19, 20, 21, 22, or 23 
wherein the mammal is a post puberty human female, and wherein the pre- 
selected disease is breast cancer, and wherein the holder means includes a 
breast support including at least one illumination means and at least one 
detection means for each breast. 
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